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Speech recogaHion method and system. 

(|7) The present invention relates to a speech 
recognition method and a system for a 
speech-con trotlable telephone in which a value 
is computed (2) for a reference word whh a 
speech recognizer (6) on the basts of a word 
uttsrad by a user, and a recognition resolution 
(6a.6b} Is made on the ttasis of said value. Prior 
to making said recognition resolution, rt is 
found out (3) if repetition of a previous word is 
in question, and If so, a new value Is computed 
(5) for the reference word on the basis of the 
valua computed by ttie speech recognizer and 
of a value in the memory, computed eaHter for 
the reference word, and a recognition resol- 
ution (6a,6b) Is made on the basis of said 
computed new value. 
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The present mventton relates to a speech recog- 
nftton method and Gystem, in partlc«4ar to a method 
and Gystem for a voloe-conlrollable telephone I'n 
which a vatua of a reference word is computed by a 
speech neoodnizer on the basis of a word uttered by s 
a user, and a recognition resolution te made on the ba- 
sis of that value. 

Telephones are liauatly provided wHh a handset 
which the user holds in his/her hands while speaking. 
This fa also the case when radioptKines such as mo- io 
bSe phones are used. Such a tel^hone feaves only 
one hand free, and thus can cause diTficultles when 
drfvlng. A solution to (hts problem cs a separate micro- 
phone placed In the car and a separata loudspeaker 
to be controlled at an appropriate volume and located is 
at an appit>priate distance from the user so that the 
user Is able to hear the other party dearly. Even with 
this design the user has to use one of his hands to 
make a call, Ue. in dialling the number of the other 
party or In responding to an Inc^omfng call, or in tar- 20 
mInaKng a call. 

For a telephone user to bo atrfa to concentrate on 
dnving, eo-called hands free telephones have been 
developed In which the operations can be voice con- 
trolled. Har0t>y. all of the telephone operations can l>e zs 
voice controlled, such as on/off switching, trans- 
mit/receive, speech volume control, phone number di- 
alling, telephone call answering, and the user Is thus 
able to concentrate on driving. The driver need not re- 
move his hands from the steering wheel nor take his 30 
eyes from the road; therefore, the hands free- 
telephone considerably increases road safety. 

A drawt>ack involved in the voice-controlled 
phono is that speech recognition is not entirely per- 
fect The background norso caused by the vehicular 35 
environment is high end thus, speech recognition t>e- 
comes more difficult Some endeavours have been 
made to market the speech recognition ability In as- 
sociation with mobile phones, but because of the un- 
reliability thereof the users' interest In voice- ^ 
controlled phones has t>een insignificant. The recog- 
nition accuracy of speech reoogntzers known in the 
art is not very good, particularly in adverse condi- 
tions, e,g. in a moving vehicle where the high back- 
ground noise substantially presents reliable word reo- 4S 
ognition. Incorrect recognition resolutions generally 
cause most of the inconveniences in implementing a 
user communications system because they might 
start undesirable operations, suc^ as termination of 
calls in the middle thereof, which is particularly incon- so 
ven'ient from the user's point of view. The most com- 
mon Consequences of erroneous speech interpreta- 
tions la dialling a wrong number. For this reason, user 
communloations are designed so that a recognition 
resolution is not made by a speech recognizer at all if 55 
It has not achieved a sufficient erisurance of the word 
uttered by the user, and in such instances, the user 
is usually requested to repeat the command uttered. 



Nearly all speech recognizers are based on the 
performance princfpfe that a word uttered by a user 
is compared by a rat tter complicated mat hod with ref- 
erence words previously stored In the memory of the 
speech recognizer. Speech reoognrzers generally 
compute a number corresponding to each reference 
word and indicating to what extent the word uttered by 
the user resembles the reference word. Finally, a rec- 
ognition resoluUon Is made on the basis of the nurrv 
bers so that the reference word which the uttered 
word most resemfciles is chosen for the resolutk>n. 
One of the most well known methods In the compar- 
ison t»etween the uttered word and the reference 
wofds Is the Dynamic Time Warping (DTW) method 
and the statistical Hidden Markow Model method 
(HMIM). 

In both the DTW and the HMM method an unfa- 
miliar speech pattern is compared with the known ref- 
erence patterns. In the Dynamic Time Warping, a 
speech pattern is divided Into a number of frames and 
the local distance between the speech part In each 
frame and the speech part equivalent to the reference 
pattern is computed. On the basis of the local distanc- 
es derived in this manner a minimum path is looked 
for between (he Initial and terminating point of the 
word by means of a DTW algorithm. Thus, with a Dy- 
namic Time Warping a distance can t>e obtained be- 
tween the uttered word and the reference words, (n 
the HMM method speech patterns are generated, 
and said speech pattern generation step is patterned 
with a status change pattern according to the Markov 
Method. Said status change pattern is thus HMM. 
Speech recognition for said received speech patterns 
now takes place by defining the observation prot>abll- 
fty for said speech patterns with the aid of the HMM 
pattern. Using the HMM in speech racognltion, an 
HMM pattern is first generated for each word to be 
recognized, Le. for each reference word. The HMM 
patterns are stored in the memory of the speech rec- 
ognizer. After the speech recognizer receives the 
speech pattern, an ol>servation probability is comput- 
ed for each HMM pattern stored in the memory, and 
as the lesutt of the recognition process, a word is pro- 
vided for the HMM pattern for which the highest ob- 
servation probability is obtained. In at her words, such 
probability is computed for each reference word with 
which it would be the wonf uttered by the user. The 
highest observation probability mentioned above de- 
scdbes the equality of the speech pattern received 
and the closest HMM pattern, i.e. the cJosest refer- 
ence speech pattern. 

Thus, in current systems the speech recognizer 
computes a certain number for the reference words 
on the basis of the word uttered by a user; in the DTW 
system the number Is the distance between words, 
and In the HMM method the numt>er indicates a prob- 
ability of the equality of the words. When using the 
HMM method, a given threshold probability is usually 
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defined for (he speech recogni2»ns which the most 
probable reference word has lo reach ki order to make 
a recognition re^ution. AMther facier df ^acting the 
recognition resolution oould be e.g. a drffarence be- 5 
tween the probabilities of the most probable word and 
the second most pmbable word; it is expected to be 
great enough so that a reoognttion resolution can be 
made. Wtien a recognition resolution is being made 
on the basts of the recognition pTotTability of the most io 
prot»able word, the erring probability is aliowed to be 
at most e.g. 0.1 . It Is therefore possible that when the 
background noise Is great for a reference word in the 
memory, such as reference word '"one*', the greatest 
prababtlity is obtained In every attempt on the t>asis t5 
of a command uttered by the user to be e.g. a 0.8 
probability compared with the ottier rererenoe words. 
Since the probability remains below the threshold 
probabaity 0.9^ this Is not accepted and the user may 
have to utter the oommand several times before ex* 20 
ceeding the reoognition protmbility limit, and the 
speech recognizer accepts the command although 
the probability may have t>een very dose to the ac- 
cept ed value. From the pointof view of the user, this 
is most disturbing. A correct recognition result can be 2s 
achieved with a first attempt using the present tech- 
nique quits often when the speed of the car Is below 
60 to dO km per hour depending on the sound insula- 
tion of the car and the user's manner of speaking. At 
higher speeds the performance of the recognizer, 30 
however, reduces very abruptly, and in most cars the 
speech recognizer no longer operates sufficiently re- 
liatily at spaeds over 1 00 km per hour to be regarded 
useful. It is particularly at said speeds at which the 
need to increase the safety in the traffic is greater 3$ 
than at lower speeds. 

in accordance with a first aspect of the invention 
there is provided speech recognition apparatus com- 
prising, comparing means for comparing a first word 
uttered by a user with an at least one predetermirted 40 
reference word, calculating means for calculating a 
value corresponding to the similarity between the 
first word uttered by theuserand tt*e at least one pre- 
determined reference word, and selecting means for 
selecting said value in accordance with a predeter- 4s 
mined criterion, characterized in that the calculating 
means Is capable of utjllsing said value in calculating 
a new value corresponding to the stmHaHty between 
a second word uttered by the userand the at least one 
reference word, 50 

in accordance with a second aspect of the inverv 
tion there \e provided a speech recognition method 
compn'stng, comparing a first word uttered by a user 
with an at leaat one predetermined raference word, 
calculating a value corresponding to the similarity bs- 55 
tween the first word uttered by the user and the at 
least one predetermined reference word, and select- 
ing said vslue In accordance with a predetermined 
criterion, characterised in that said value is used to 



cafculato a new value corresponding to t hie similarity 
between a second word uttered by the user and the 
at least one refansnce word. 

The invention has the advantage that a more re- 
HatHa recognition of words is possible, even If the sim- 
ilarity of uttered words and reference words is not 
high. 

In an ambodimentof the first and second aspects 
of the invention, there is provided more than one ref- 
erence word. This has the advantage that the user is 
a 1)1 e to utilise voice control in controlling apparatus 
comprising the invention and having more than one 
voice controlled function. 

In a preferred embodiment of the first and second 
aspeclB of the Invention, the second word uttered by 
the user Is the same as the first word uttered by the 
user. This has the advantage that a second calcula- 
tion is only performed If the second uttered word is 
the same as the first uttered word, thereby avoiding 
unnacesaary ciefay in recognising uttered words. 

In an alternate emlx>dtment of ttie first and sec- 
ond aspects of tiie invention said value Is only used 
in calcu1atlr>g a new value when the second word ut- 
tered by the user is the same as the first word uttered 
by the user. This has the advantage that unneces- 
sary calculations are avoided, and that a previous val- 
ue is only used to aid recognition of a repeated word 
uttered by the user. 

PreferatMy. the predetermined criterion Is that 
said value is less than a predetermined threshold val- 
ue, or alternatively the predetermined criterion is that 
the difference between said value and another value 
, corresponding to the similarity between the first v^Aard 
uttered by the user and another t&fGrenc9 word ex- 
ceeds a predetermined threshold value. This has the 
advantage that further utterances and calculations 
are necessary only when an uttered word cannot be 
reliably recognised, or when an uttered word Is similar 
to two different reference words. 

Suitably, repetition of the first word uttered by the 
user is requested when said value does not fulfil said 
pre determined criterion, which clearly indicates to 
the user that an uttered word has not been recog- 
nised and that It Is necessary for the user to repeat 
tfie word. 

In the method of the invention, a speech recog- 
nizer computes the recognition probabilities for refer- 
ence words and makes a recognition resolution If one 
of the probabilities exceeds a predetermined thresh^ 
old value; otherwise, the user Is requested to utter 
the word again and a recognition resolution is made 
thereon if the probability of one of the reference 
words exceeds a predetermined threshold value: 
otherwise, a new probability is computed utilizing the 
prevailing probability computed by the speech recog- 
nc^er, and a probability computed in one or more pre- 
ceding times, on the cxjndition that they are probabil- 
ities of one and the sama reference word, and that a 
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recogmtlon rasolutkin is made if said probabflity ex- 
ceeds a prodetermined tiKwhold valua. Unless (he 
prddatermlned threshold value is exceeded by the 
probabQity oomputad by the speech recogncter. the s 
computed probabaity is stored In the memory, the 
fs requested to utter the word once again, end 
the valua stored In the rnanx^iy ie used together with 
the subsequent probabil'rty / ptobabRities cxxnputed 
for the same word by the speech recognlzei; ffi order io 
to compute a new probability to be computed on the 
basis of said prat>abHltles (for making a recognition re- 
solution If, taJdng in consideration the preceding prob- 
abNItfes* a threshold probabllty Is achieved). There- 
after, when the speech recognizer computes a prot>- iS 
ability exceeding the threshold value, or it is reached 
by taking In consideration the preceding probatjilities, 
the memory is reset. Also In the instance in which a 
repetition of a previoua word Is In question, the mem- 
ory Is reset prior to a recoQnltjon resolutron. The mem- 
cry Is reset also wtten the power Is switched an in the 
apparatus and if an operation la interrupted. 

The Invention is descdbed In detaB below, by way 
of example only and with reference to the accompa- 
nying drawings. In which 29 
Rg. 1 shows a principle flow diagram of the steps 
to be taken in the method, and 
Fig. 2 shows a block diagram for Implementing 
the method in a system in which speech recogni- 
tion is used. 30 
In Fig. 1 the speech recognition method of the 
invention Is <:farlf led. The method la not dIfectJy asso- 
ciated wrtti the Internal method of the speech recog- 
nizer used in speech reoognltion. but by using the 
method achieving a recognition resolution is acceler* ss 
atad and the recognition precisian is impmved v^th- 
out paying attention to the properties of the actual 
speech recognizer. When the power Is switched on 1 
In the means, the memory is reset and an utterance 
2 from a user ts expected to take place, whereby the 40 
speech recognizer computes 2 probabilities for all ref- 
erence words and as a result of recognition. It yields 
tho reference word possessing the greatest probabil- 
ity, l_e. the reference word resembling most the word 
altered by ttte user. If the protiability of t he reference 
word does not exceed a predetermined threshold val- 
ue or the threshold value of the probabilities of the 
most probable and the second most probable word, 
in the present context called commonly the threshold 
values of speech recognition, it is found out 3 if the so 
word being examined is a repetition of the preceding 
word, tf repetition of such preceding word is not in 
question, the memory is reset 4a. When the user has 
not uttered the word more than once, the memory 
contains nothing during the first computeHon round, £5 
whereby no new probability is computed, either, but 
a recognition resolution is made 6a. and if no reliable 
recognition can be made 6b, a probability computed 
by Che speech recognizer is stored 7 In the memory. 



and a subsequent utterance from the user Is awaited. 
If, on the other hand, the word is a repetition of the 
previous word, a new piYsbability Is computed S mak- 
ing use in the computetlon of the pral>ability of d pro- 
ceding recognition attemptstored In the mwrmy, and 
en the basis thereof, a recognition resolutkin Is made 
6a. 6b, tf the new probability is obtained by means of 
the oomputetions 5 exceeds the threstiofd value, Le, 
a reliable recognition can t>e made 6b, the memory Is 
reset 4b, and a subsequent utteranM 2 from the user 
and a recognition result obtained 2 from the speech 
recognizer ara expected to take place, etc. if the new 
probability is below tt^ threshold value* so ttiat no re- 
liable recognition can be made, said new probabilfty 
is stored in the memory 7. and a subsequent utter- 
ance 2 of the user Is expected to teke place, etc. If one 
of the funcMons is Interrupted, the memory te neset, 
so that nothing that would distort a new recognition 
to be sterted after the interruption remains there. The 
method according to the Invention may also i>e imple- 
mented so that the recognition resolution 6a. 6b is 
made before findmg out 3 whether a repetition of the 
preceding word is In question or not Now, if the value 
computed by the speech recognizer for the repeated 
word exceeds the set threshold value, no such co 
mputatcon of a new probability needs to be done In 
which the values computed In preceding recognition 
attempts are taken into account. 

For carrying out the computation process, sever- 
al computation procedures can be developed, by us- 
ing which a more appropriate probability can be pro- 
vided through the use of the preceding probability in 
sucti computetions. However, the mast useful formu- 
la is the computation formula of conditional probabil- 
ity. For demonstrating the computation procedure 
used in the method, the utIlisaGon of conditional protv 
ability computation is described below in detail and In 
conjunction with the method of the invention. Such 
situation Is examined in which a user utters first a 
word A and then ward B after the user has been re- 
quested by the system to repeat that word. A speech 
recognizer computes e.g. the following prot>ab]Iities 
for both words A and B: 
P(A-1) = 0.7 (probability that A was "one") 
P(A=2) = 0.3 (probabflity that A was two*) 
P(B=i) = 0.8 (probability that B was -one') 
P(B=2) = 0.2 (probability that B was "two") 

If 0.9 has been set for the threshold value of the 
recognition resolution, no recognition reso|utk»n can 
be made concerning either recognltk»n. When we 
know that the user uttered the same word t>oth times, 
the refiabaity of the recognition can be Increased by 
utilizing the probability computed by one or more of 
the preceding and present recognitions in order to 
compute a new probability. This can be done e.g. with 
a conditional probability computation as follows: 

P(B = 1/A- B) = PiB=lJ^«^ ^ 
^ ' P(A^B) 
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P(B i and ((A ^ 1 and B c 1> Of (A « 2 and B ^ 2))) 
P(A-B) 

P((A^1 andB=a1>orffl = 1 and A =i 2 and B «^ 2)) 
P<A = B> 

^ P(A«°n andBip^D 

P((A«1 andB«1)Dr(Aa2QndB==2» 

_ 0,7 » 0-8 „ 0.S6 _ 0 903 

" 0.7 * 0.8 + 0.3 * 0.2 ■ 0.62 

The above oomputatJort in which a probability 
was computed for tho particular that the seMnd 
word, i.o. B, ia Nsne** on the condition that A is equal 
to B, in other words, the first word Is the aama as the 
second one, leads to a new probablity which In the 
present case exceeds the threshold value, and a rec- 
ognition resolution can be made. Even if the new 
pfobabMlity did not exceed the threshold value, It is, 
however^ tjetter than the Individual probability com- 
puted t>y the speech recognizer, and in this manner 
said newppotkablllty la stared In the memory and used 
in calculating a subsequent new prot>abIllty together 
with a subsequent probability computed by the 
speech leoognlzer. Itrs also found that the difference 
with the second most probable word increases. The 
atK>ve formula can be simplified by using onty the nti- 
merator Instead of the denominator and multiplied 
with an appropriate constant Y: 

P(B «=x|A=:B) = Y* P(A = X and B ^ x) =s 
Y • P{A = X) • P(B = x) 

Thus, If the user utters a word N times, the total 
protiability of each reference word r Is ot>tained as fol- 
lows: 

P(r) = Y • P(r,1) • P(r,2) • ... * P(r.N) 
where P(r,1) is the first utterance of reference word r« 
P(r,2) is the second and N is the last utterance there- 
of. In the atx3ve example a probability for a given ref- 
erence word was computed. In oonformance with the 
threshold criteda of speech recognition, the dfffer- 
enoe of the probabilities of two reference words (of 
the reference word which obtained the highest prob- 
ability and the second highest probabltity from the 
speech recognizer), automatically Increases and 
therefore the recognition reliability is Improved. It is 
simple to use the above methods of computation 
when the HMM metfiod is used in the speech recog- 
nizer because in such Instances it computes for each 
reference word the probability of the wotd uttered by 
the user. When using the OTW method, the compu- 
tation is not quite so straightforward because now a 
probability Is not computed for reference words In the 
speech reoognizer, tnjt a distance or a standard of 
how far the uttered word Is from each reference word. 

Therefore, in order to improve the recognition re- 
liability according to the method of making use of prior 
probabilities, the standard or distance has fifst to be 
transfbrfned into a probability. In DTW it Is thus pos- 
sil>le to describe with a number 0(r,i) to what extent 



aach raferenc^a word r resemt>leG an uttered word In 
a repeated Sme i. Hereby, a prot>ablllty can t>e com- 
puted from the numt>er with the aid of a function fQ. 
5 e.g. a nofvllnear function, as follows; 

0(r) = f{D(r,1).D{r,2) D{r.N» 

Alternatively an estimate erf the prol>ablity of a 
reference word from the result yielded by a DTW al- 
gorithm by means of an estimate gQ, whereby the re- 

10 suit computed by the speech recognizer <^n be oon- 
verted into a piobatHlity and the prot>at>aity of an l:nth 
repetition of reference word r Is now P<r,i) = 9(D(r,l)) 
whereby the number P(r,0. can ba used according to 
the method in calculating a new protvabHlty as descri- 

15 bed above. 

One way of tfnplementing the mattK>d of the In- 
vention In the speech recognition system Is shown in 
Fig. 2. With the method, the recognition accuracy of 
the speech reoognition system can be Improved in 

20 which the speech recognizer 8 provides recognition 
results. La. recognition probabHttles, which are fed to 
the recognition results' processing unit 9. Each rec- 
ognition result contains a Kst of the words to be rec- 
ognized, for each of which a probability (or other qual- 

2S ity factor) has been computed, describing to what ex- 
tent a word uttered by the user bears resemblance to 
each reference word. The reference words may have 
been stored in advance in the internal reference word 
memory of the speech recognizer d or the speech 

3o recognizer has been made atile to learn" words ut- 
tered by the user. However, this detail has no bearing 
from the point of view of the invention on how and 
when the reference words have t>een stored in the 
reference word memory, and the speech recognizer 

3S ' 8 need not have a reference word memory, ff a word 
cannot be recognized at a sufficient reliability, the 
user communications device 11 requests the user to 
repeat the word. In such Instance, the user commu- 
nications device 11 gives Information to the process- 
Ing t>Iock 9, of the reoognKJon results on whether a 
word is going to be repeated by the user or noL When 
the user communications device 11 informs the proc- 
essing uniL 9 that a repetition ofa word Is to bs expect- 
ed, the data stored in conjunction with the preceding 

4^ recognition attempt are accessed from the momory 
10. and new probabHities are computed for the refer- 
ence words in a manner regarding the preceding val- 
ues according to the invention. If no sufficiently relt- 
abrfe recognition can be made, even based on the new 

so probabilities, said new« higher calculated prot>abiHties 
are nevertheless stored In the memory 10. After a 
successful recognition has been made, the memory 
10 is reset. The memory Is also reset when data is 
sent to the processing t>lock 9 from the user commti- 

55 nications de^ce 11 that the next word to enter is not 
the same as the previous one. In practice, the system 
may be such that the processing block 9, the memory 
10 and the user communications t>lock 11 form part 
of the same processor. I.e. they are realized with the 

5 



EP 0 073 301 A1 



10 



aid of a procsesor. The proc«««or can be one ar^ 
ranged partlcufarty ^ the speech recognition eystein 
or it may be the main processor for a radiophone, Typ- 
Icelly, filso the speech reco0nizar 6 compridas a etQ- 
na] praoessor. 

WHh the aid of the prosant Invention^ the speech 
recognition accuracy can be improved although the 
b^c parformance of the speech recognizer itself is 
not improved. When the recognition accuracy la inv 
pro\/ed, the declalon making concerning tecognitlon 
l8 accelerated and a more user IHendly hands free 
phone can be kmpiemanted. The prasent Invenfion Is 
not limited to the fdrmuf a of the example shown In Fig. 
1 but various functions may also t>e performed k% a 
different order. 

In view of the foregoing description it will be evi- 
dent to a person ekiled in the art that various modi- 
fications may be made within the scope of the inven- 
tion. 

The scope of the present disclosure Includes any 
novel foature or combination of features disclosed 
therein either explicttiy or Impllcldy or any generalisa- 
tion tiiereof irrespective of whether or not it relates to 
the claimed Invention or mitigatos any or all of the 
problems addressed by the present Invention. The 
applicant hereby gh^es notice that new c^laims may be 
formulated to such features during the prosecution of 
this application or of any such further application de- 
rived therefrom. 



Claims 

1. Speech recognition apparatus comprising, 

comparing means for companng a first 
word uttered by a user with an at least one pre- 
determined reference word, 

calculating moans for calculating a value 
corresponding to the similarity between the first 
word uttered by the user and the at least one pre- 
determined rererence word^ and 

selecting means for selecting said value in 
accordance with a predetermined criterion, char- 
acterized in that the calculating means Is capable 
of utilising said value In calculating a new value 
corresponding to the similarity between a second 
word uttered by the user and the at least one ref- 
erence word. 

2. Speoch recognition apparatus according to claim 
1, comprising more than one prvdatermined ref- 
erence word. 

3. Speech recognition apparatus according to dalm 
1 or claim 2, wherein the second word uttered t>y 
the user Is the s^me as the first word uttered by 
the user. 



4. Speech racognltlon apparatus according to any 
preceding dalm, wherein the calculating means 
utflises said value in calculating a new value only 

s when the second word uttered by the user is t tie 

same as the first word uttered by the user. 

5. Speech leoognltion apparatus according to any 
preceding claim^ wtiarein the pradetermlned cri- 

10 terion is that said value Is less than a predeter- 

mined threshold value. 

6. Speech recognition apparatus acoonjing to any 
of claims 2 to 4, wherein the predetermined crt- 

19 tenon is that the dirference between said value 

and another value corresponding to the similarity 
between the first word uttered by the user and 
another reference word exceeds a predeter- 
mined threshold value. 

20 

7. Speech recognition apparatus acoordlng to daim 
5 or cfafm 6, wherein repetition of the first word 
uttered by the user Is requested when said value 
does not fuiril said predetermined cirllerion. 

2S 

fl. speech reoognition apparatus according to claim 
5 or daim 6, wtierein when said value fulfils said 
predetermined criterion the apparatus is reset 
and awaits a further utterance by the user. 

30 

9, Speech recognition apparatus acoordtng to any 
preceding daim, wherein said value and said new 
value are respective probabilities that the first ut- 
tered word and the second uttered word res pec- 
35 tivefy correspond to the at least one predeter- 

mined reference wcrd< 

V Speech recognition apparatus according to claim 
9, wherein said value and said new value are cat- 
40 culated by ubHsing a conditional prot>ablllty com- 

putation. 

11. Speech recognition apparatus according to any 
preceding claim, comprising a memory means 

^ (10) for storing said value and said new value. 

12. A Speech recognition method comprising, 

comparing a first word uttered by a user 
with an at least one predetermined reference 
543 word, 

calculating a value corresponding to the 
stmiiadty t>etween the first word uttered by the 
user and the at least one predetermined refer- 
ence word, and 
ss selecting said value In accordance with a 

predetermined criterion, characterised in that 
said value is used to calculate a new value cor- 
responding to the sUnilarCty between a second 
word uttered by the user and the at least one ref- 
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13. Amathod according to dalm 12 vyherein there are 
more than one predetermined reference wofd. 

14, Amethod aooording to dalm 1Z wherein the 80C- 
ond word uttered by the user Is the seme sa the 
first word uttered by the user. 

1 5« A method according to claim 12. wherein aaid val- 
ue Is only used in calculating a new value when 
the second word uttered ty the user Is the aanne 
as I he first word uttered by the user. 

1 6, Amethod according to any one of daima 1 2 to 1 5. 
wherein the predetermined criterion Is that asid 
value is lower than a predetermined threshold 
value. 
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20 



1 7. Amethod according to any one of dahns 13 to 15. 
wherein the predetermined <sriterton is that the 
difference iietween said value and another value 
corraaponding to the similarity between the first 
word uttered by the user and another refers nca 26 
word exceeds a predetermined threshold value. 

18. A method according to claim 16 or claim 17, 
wherein repetition of the first word uttarad by the 
user Is requested when said value does not fulfil 30 
s^id predet^mtned criterion. 

19. A method according to claim 16 or claim 17, 
wherein when said value fulfils said predeter- 
mined criterion the method is initialised and reset 3S 
to Its start point 

20. A method according to any preceding daim, 
wherein said value and said new value are re-' 
spectfve probabilities that the first uttered word 40 
and the second uttered word respectively corre- 
spond to the at least one predetermined refer- 
ence wofd, 

21 . A method acoordtng to claun 20. wherein said val- ^3 
ue and said new value are calculated utiising a 
conditional probablity computation. 
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